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ABSTRACT 

[Abstract of the Disclosure] 

An information storage medium on which subtitles for supporting multiple 
5 languages using text data and downloadable fonts are recorded and an apparatus 
therefor are provided. The information storage medium on which video data is 
recorded, comprises a plurality of clips that are recording units in which video data is 
stored; and text data for subtitles which is recorded separately from the clips and are to 
be overlapped with an image based on video data and then output, wherein the text 
10 data for subtitles includes data for providing subtitles in at least one language. 
According to the information storage medium, more diverse subtitle can be provided to 
users and the arrangement of recording space can be conveniently performed. 

[Representative Drawing] 
15 FIG. 10 
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SPECIFICATION 

[Title of the Invention] 

Information Storage Medium Containing Subtitle Data for Multiple Languages 
Using Text Data and Downloadable Fonts and Apparatus Therefor 

[Brief Description of the Drawings] 

FIG. 1 is a diagram of a data structure for a DVD; 

FIG. 2 is a detailed diagram of a VMG area; 

FIG. 3 is a detailed diagram of a VTS area; 

FIG. 4 is a detailed diagram of a VOBS that is video data; 

FIG. 5 is a diagram showing the relation between an SPU and SP_PCK; 

FIG. 6 is a diagram showing run-length encoding of an SPU; 

FIG. 7 is a diagram of the data structure of SP_DCSQT; 

FIG. 8 is a reference diagram showing an output situation with sub picture data 
considered; 

FIG. 9 is a block diagram of a reproducing apparatus according to a preferred 
embodiment of the present invention; 

FIG. 10 is a diagram of the data structure of text data stored in an information 
storage medium according to the present invention; 

FIG. 11 is a preferred embodiment of text data for subtitles according to the 
present invention; 

FIG. 12 is a diagram of the data structure of text data for a language other than 
the language of FIG. 1 1 ; 

FIG. 13 is an example of a text file used in the present invention; 

FIG. 14 is an example of a subtitle to which a different style is applied; 

FIG. 15 is an example of a subtitle displayed after changing a line; 

FIG. 16 is an example showing a case where a user executes a language 
change while subtitles in a predetermined language are being reproduced; 

FIG. 17 is an example of a plurality of language sets of subtitle data and font data 



for multiple languages; 

FIG. 18 is a diagram showing correlations of PlayList, Playltem, clip information, 
and a clip; 

FIG. 19 is an example of a directory structure according to the present invention; 
5 FIG. 20 is an example showing a case where a reproducing apparatus outputs 

only subtitle data; 

FIG. 21 is an example showing a case where a reproducing apparatus outputs 
subtitle data for more than one language at the same time; 

FIG. 22 is an example showing a case where during reproduction of only subtitle . 
10 data, normal reproduction of video data begins from video data corresponding to subtitle 
line data; and 

FIG. 23 is a block diagram of a recording apparatus according to the present 
invention. 

15 [Detailed Description of the Invention] 
[Object of the Invention] 

[Technical Field of the Invention and Related Art prior to the Invention] 

The present invention relates to an information storage medium on which 
subtitles for supporting multiple languages using text data and downloadable fonts are 
20 recorded and an apparatus therefor. 

Conventional digital versatile discs (DVD) use a bitmap image as a subtitle. 
Subtitle data of bitmap images are losslessly coded and recorded on a DVD, on which a 
maximum of 32 subtitles can be recorded. 

The data structure of video data on a DVD, which is one of the several types of 
25 conventional multimedia information storage media, will now be explained. 

FIG. 1 is a diagram of a data structure for a DVD. 

Referring to FIG. 1 , the disc space of a DVD that is a multimedia storage medium 
is divided into a VMG area and a plurality of VTS areas. Title information and 
information on a title menu are stored in the VMG area, and information on the title is 
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stored in the plurality of VTS areas. The VMG area comprises 2-3 files and each VTS 
area comprises 3-12 files. 

FIG. 2 is a detailed diagram of a VMG area. 

Referring to FIG. 2, the VMG area includes a VMGI area storing additional 
5 information on the VMG, a VOBS area storing video information (video object) on the 
menu, and a backup area for the VMGI. These areas exist as one file and among 
them the presence of the VOBS area is optional. 

In the VTS area, information on a title, which is a reproduction unit, and a VOBS, 
which is video data, are stored. In one VTS, at least one title is recorded. 
10 FIG. 3 is a detailed diagram of a VTS area. 

Referring to FIG. 3, a VTS area includes video title set information (VTSI), a 
VOBS that is video data for a menu screen, a VOBS that is video data for a video title 
set, and backup data of the VTSI. The presence of the VOBS for displaying a menu 
screen is optional. Each VOBS is again divided into VOBs and cells that are recording 
15 units. One VOB comprises a plurality of cells. The lowest recording unit mentioned in 
the present invention is a cell. 

FIG. 4 is a detailed diagram of a VOBS that is video data. 

Referring to FIG. 4, one VOBS comprises a plurality of VOBs, and one VOB 
comprises a plurality of cells. A cell comprises a plurality of VOBUs. A VOBU is data 

20 coded by a moving pictures expert group (MPEG) method of coding moving pictures 
used in a DVD. According to the MPEG method, since images are spatiotemporal 
compression encoded, in order to decode an image, previous or following images are 
needed. Accordingly, in order to support a random access function by which 
reproduction can be started from an arbitrary location, intra encoding which does not 

25 need previous or following images is performed for every predetermined image. This 
image is referred to as an infra picture or I picture in the MPEG and those between an I 
picture and the next I picture are referred to as a group of pictures (GOP). Usually, a 
GOP comprises 12-15 pictures. 

Meanwhile, the MPEG defines system encoding (ISO/IEC13818-1) for 
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encapsulating video data and audio data into one bitstream. The system encoding 
defines two multiplexing methods, including a program stream (PS) multiplexing method 
which is suitably for producing one program and storing the program in an information 
storage medium, and a transport stream multiplexing method which is appropriate for 
5 making and transmitting a plurality of programs. In the methods, the DVD employs the 
PS encoding method. According to the PS encoding method, video data and audio 
data are respectively divided in the units of packs (PCK) and are multiplexed through 
time division of the packs. Data other than the video and audio data defined by the 
MPEG are named as a private stream and also included in PCKs so that the data can 

10 be multiplexed together with the audio and video data. 

A VOBU comprises a plurality of PCKs. The first PCK in the plurality of PCKs is 
a navigation pack (NV_PCK). Then, the remaining part comprises video packs 
(V_PCK), audio packs (A_PCK), and sub picture packs (SP_PCK). Video data 
contained in a video pack comprises a plurality of GOPs. 

15 The SP_PCK is for 2 dimensional graphic data and subtitle data. That is, in the 

DVD, subtitle data that appear overlapping a video picture are coded by the same 
method as used for 2 dimensional graphic data. That is, for the DVD, a separate 
coding method for supporting multiple languages is not employed and after converting 
each subtitle data into graphic data, the graphic data is processed by one coding 

20 method and then recorded. The graphic data for a subtitle is referred to as a sub 
picture. A sub picture comprises a sub picture unit (SPU). A sub picture unit 
corresponds to one graphic data sheet. 

FIG. 5 is a diagram showing the relation between an SPU and SP_PCK. 
Referring to FIG. 5, one SPU comprises a sub picture unit header (SPUH), pixel 

25 data (PXD), and a sub picture display control sequence table (SP_DCSQT), which are 
divided and recorded in this order into a plurality of 2048-byte SP_PCKs. At this time, 
if the last data item of the SPU does not completely fill one SP_PCK, the remaining part 
of the last SP_PCK is filled by padding. Accordingly, one SPU comprises a plurality of 
SP PCKs. 
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In the SPUH, the size of the entire SPU and a location from which SP_DCSQT 
data begins are recorded. PXD data is obtained by encoding a sub picture. Pixel 
data forming a sub picture can have 4 different types of values, which are a background, 
a pattern pixel, an emphasis pixel-1 , and an emphasis pixel-2 that can be expressed by 
5 2 bit values and have binary values of 00, 01, 10, and 11, respectively. Accordingly, a 
sub picture can be deemed as a set of data having the four pixel values and formed with 
a plurality of lines. Encoding is performed for each line. As shown in FIG. 6, the SPU 
is run-length encoded. That is, if 1-3 predetermined pixel data items continue, the 
number of continuous pixels (No_P) is expressed by 2 bits and after that, a 2-bit pixel 

10 data value (PD) is recorded. If 4-15 pixel data items continue, the first 2 bits are 
recorded as 0's, then No_P is recorded by using 4 bits, and PD is recorded by using 2 
bits. If 16-63 pixel data items continue, the first 4 bits are recorded as 0's, then No_P 
is recorded by using 8 bits, and PD is recorded by using 2 bits. If pixel data items 
continue to the end of a line, the first 14 bits are recorded as 0's, and then PD is 

15 recorded by using 2 bits. If alignment in units of bytes is not achieved when encoding 
of a line is finished, 4 bits are recorded as 0's. The length of encoded data in one line 
cannot exceed 1440 bits. 

FIG. 7 is a diagram of the data structure of SPJDCSQT. 

Referring to FIG. 7, SP_DCSQT contains display control information for 
20 outputting the PXD data. The SP_DCSQT comprises a plurality of sub picture display 
control sequences (SP_DCSQ). One SP_DCSQT is a set of display control 
commands (SP_DCCMD) performed at one time, and comprises SP_DCSQ_STM 
indicating a start time, SP_NXT__DCSQ_SA containing information on the location of the 
next SP_DCSQ, and a plurality of SP_DCCMD. 
25 The SP_DCCMD is control information on how the pixel data (PXD) and video 

pictures are combined and output, and contains pixel data color information, information 
on contrast with video data, and information on an output time and a finish time. 

FIG. 8 is a reference diagram showing an output situation considering sub picture 

data. 
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Referring to FIG. 8, pixel data itself is losslessly coded as PXD. SP_DCSQT 
contains information on an SP display area, which is a sub picture display area in which 
a sub picture is displayed in a video display area that is a video image area, and 
information on the start time and finish time of output. 
5 . In a DVD, sub picture data for subtitle data of a maximum of 32 different 

languages can be multiplexed with video data and recorded. Distinction of these 
different languages is performed by a stream id provided by the MPEG system 
encoding and sub stream id defined in the DVD. Accordingly, if a user selects one 
language, SPUs are extracted from only SP_PCKs having stream id and sub stream id 

10 corresponding to the selected language, then decoded, and subtitle data are extracted. 
Then, output is controlled according to display control commands. 

Many problems arise from the fact that subtitle data are multiplexed together 
with video data as described above. 

First, the amount of bits to be generated for sub picture data should be 

15 considered when video data are coded. That is, since subtitle data is converted into 
graphic data and processed, the amount of generated data for respective languages are 
different from each other and also the amounts are huge. Usually, after encoding of 
moving pictures is performed once, sub picture data for each language is again 
multiplexed being added to the output of the encoding such that a DVD appropriate to 

20 each region is produced. However, depending on the language, the amount of sub 
picture data is huge such that when sub picture data is multiplexed with video data, the 
entire amount of generated bits exceeds a maximum allowance. In addition, since sub 
picture data is multiplexed between video data, the start point of each VOBU is different 
according to the region. Since the start point of a VOBU is separately managed, 

25 whenever a multiplexing process newly begins, this information should be updated. 

Secondly, since the contents of each sub picture cannot be known, sub picture 
data cannot be used for additional purposes, such as for outputting two languages at a 
time for a language by outputting only subtitle data. 
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[Technical Goal of the Invention] 

The present invention provides an information storage medium on which sub 
picture data is recorded with a data structure in which when video data are coded, the 
amount of bits to be generated for sub picture data need not be considered in advance 
5 and an apparatus therefor. 

The present invention also provides an information storage medium on which sub 
picture data is recorded with a data structure in which sub picture data can be used for 
purposes other than subtitles and an apparatus therefor. 

10 [Structure and Operation of the Invention] 

According to an aspect of the present invention, there is provided an information 
storage medium on which video data is recorded, comprising a plurality of clips that are 
recording units in which the video data is stored; and text data for subtitles which are 
recorded separately from the clips and are to be overlapped with an image based on the 

15 video data and to be output, wherein the text data for subtitles include data for providing 
subtitles in at least one language. 

The information storage medium further comprises font data of characters which 
are recorded separately from the clips and for graphic expression of the text data and 
are used in the text data for subtitles. 

20 In the information storage medium, the text data for subtitles is recorded in a 

separate space for each language if the text data is of multiple languages, and the text 
data for subtitles includes character data to be converted into graphic data and then to 
be output and output synchronization information for synchronizing with the video data. 
Preferably, the text data for subtitles includes character data to be converted into 

25 graphic data and then to be output and output location information indicating a location 
in which the graphic data is to be displayed when the graphic data is overlapped with an 
image based on the video data. Preferably, the text data for subtitles includes 
character data to be converted into graphic data and then to be output and information 
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for expressing the output of the graphic data in a plurality of sizes when this graphic 
data is overlaid with a video image. 

It is preferable that the video data is divided into units that should be reproduced 
continuously, and the size of the text data for subtitles, as a whole, corresponding to the 
5 one unit is limited. 

In the information storage medium, the video data is divided into units that should 
be continuously reproduced and comprise a plurality of reproducing units, and the text 
data for subtitles corresponding to the one reproducing unit is divided into a plurality of 
language sets and the size of the text data for subtitles, as a whole, forming one 
10 language set is limited. 

Preferably, the text data forming the text data for subtitles is expressed and 
recorded in Unicode for supporting multi-language character sets. If the text data for 
subtitles is formed only with characters of ASCII, which is a basic English character set, 
or IS08859-1, which is a Latin-extended character set, the text data is coded and 
15 recorded by using UTF-8 by which one character is coded into a plurality of 8-bit units. 
If the text data for subtitles includes a character having a code point value of a 2-byte 
size in Unicode, the text data is coded and recorded by using UFT-16 by which one 
character is coded into a plurality of 16-bit units. 

It is preferable that the information storage medium is a removable type that is 
20 easily placed on and removed from a reproducing apparatus, and the information 
storage medium is an optical disc from which the reproducing apparatus is capable of 
reading data by using an optical apparatus of the reproducing apparatus. 

According to another aspect of the present invention, there is provided a 
reproducing apparatus which reproduces data from an information storage medium on 
25 which video data is recoded, wherein on the information storage medium, video data is 
coded and divided into clips that are recording units and recorded in a plurality of clips, 
and text data for subtitles that are formed with data of a plurality of languages and are 
overlapped as graphic data with an image based on the video data and then output are 
recorded separately from the clips, the reproducing apparatus comprising: a data 
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reproducing unit which reads data from the information storage medium; a decoder 
which decodes the coded video data; a renderer which converts the text data for 
subtitles into graphic data; a blender which overlays the video data with the graphic data 
to generate an image; a buffer which temporarily stores the video data; and a buffer 
5 which stores the text data for subtitles. 

Preferably, on the information storage medium, font data of characters that are 
used in the text data for subtitles for graphic expression of the text data are recorded in 
a separate space, and the reproducing apparatus has a buffer for temporarily storing 
the font data, and the renderer has a function for converting the text data for subtitles 

10 into graphic data by using the font data. 

It is preferable that if the text data for subtitles is data of multiple languages, the 
text data is recorded in a separate space for each language, and the reproducing 
apparatus has a function by which the text data for subtitles for a language that is 
selected by a user or is set as an initial reproducing language in the reproducing 

15 apparatus is temporarily stored in the buffer; the font data for converting the text data for 
subtitles into graphic data is temporarily stored in the buffer for fonts; and while 
simultaneously reproducing video data, the text data for subtitles is converted into 
graphic data and the graphic data is output. 

On the information storage medium, the text data for subtitles including 

20 synchronization information, by which the text data for subtitles is converted into graphic 
data and the graphic data is overlapped with an image based on the video data and 
output, is recorded. Accordingly, the reproducing apparatus has a function for 
controlling an output start time and end time of the text data for subtitles, by \ 
synchronization information. 

25 On the information storage medium, the text data for subtitles, including 

character data, which will be converted into graphic data and be output, and output 
location information indicating a location where the graphic data is to be output when 
the graphic data is overlapped with an image based on the video and output, are 
recorded. Accordingly, the reproducing apparatus has a function for controlling a 
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location where the text data for subtitles are overlapped with an image based on the 
video data, by the output location information. 

Preferably, the video data recorded on the information storage medium is divided 
into units that should be reproduced continuously and within a limited size of the entire 
5 text data for subtitles corresponding to the recording unit, the text data is recorded; and 
the reproducing apparatus has a function by which all of the text data for subtitles 
whose size is limited is stored in the buffer before reproducing the units that should be 
reproduced continuously. If the language changes during reproduction, subtitle data 
corresponding to the language stored in the buffer is output. 

10 Preferably, the video data recorded on the information storage medium is divided 

into units that should be reproduced continuously; the text data for subtitles 
corresponding to the one unit is divided into a plurality of language sets; the text data for 
subtitles forming the one language set is recorded within a limited size of the entire text 
data; and the reproducing apparatus has a function by which the text data for subtitles 

15 corresponding to a language set containing the subtitle data which should be output 
simultaneously with the video data is all stored in the buffer before reproducing the unit 
that should be reproduced continuously. And, when language change occurs during 
reproduction, if the text data for subtitles for the language is in the buffer, the text data 
for subtitles for the language are output, and if the text data for subtitles for the 

20 language is not in the buffer, all of the text data for subtitles corresponding to the 
language set containing the text data for subtitles for the language is stored in the buffer 
and then output. 

On the information storage medium, the text data for subtitles containing 
character data, which will be converted into graphic data and be output, and information 
25 for indicating the output of a plurality of graphic data items when the graphic data is 
overlapped with an image based on the video data, are recorded. Accordingly, the 
reproducing apparatus has a function by which the size of the subtitle data is selected 
among the plurality of sizes in the size information by a user and is output. 
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On the information storage medium, text data forming the text data for subtitles 
are expressed and recorded in Unicode for supporting multi-language sets, and the 
renderer converts the characters expressed in the Unicode into graphic data. 

Preferably, on the information storage medium, if the text data for subtitles is 
5 formed only with characters of ASCII, which is a basic English character set, or 
IS08859-1, which is a Latin-extended character set, the text data is coded and recorded 
by using UTF-8 by which one character is coded into a plurality of 8-bit units, and the 
renderer converts characters expressed by the UFT-8 into graphic data. 

On the information storage medium, if the text data for subtitles includes a 
10 character having a code point value of a 2-byte size in the Unicode, the text data is 
coded and recorded by using UFT-16 by which one character is coded into a plurality of 
16-bit units, and the renderer converts characters expressed by the UTF-16 into graphic 
data. 

Preferably, the information storage medium is a removable type that is easily 
15 placed on and removed from a reproducing apparatus, and the reproducing apparatus 
reads and reproduces data on the removable information storage medium. 

It is preferable that the information storage medium is an optical disc from which 
the reproducing apparatus is capable of reading data by using an optical apparatus of 
the reproducing apparatus and the reproducing apparatus reads and reproduces data 
20 recorded on the optical disc. 

Preferably, the reproducing apparatus has a function for converting only the 
subtitle data into graphic data and outputting the graphic data without reproducing video 
data recorded on the information storage medium, or the reproducing apparatus has a 
function for reading and outputting subtitle data for more than one language if on an 
25 information storage medium, text data for subtitles of a plurality of languages is 
recorded. 

For better effects, the reproducing apparatus has a function by which during 
outputting the text data for subtitles, reproducing video data begins from video data 
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synchronized with corresponding subtitle data and subtitle data is overlapped with a 
video image and then output. 

According to still another aspect of the present invention, there is provided a 
recording apparatus which records video data on an information storage medium, 
5 comprising: a data writer which writes data on the information storage medium; an 
encoder which codes the video data output from the data writer; a subtitle generator 
which generates subtitle data that will be added to the video data and then output; a 
central processing unit (CPU); a fixed-type storage; and a buffer, wherein the encoder 
stores video data in the fixed-type storage after dividing video images into clips that are 

10 recording units, and compression encoding the clips; the subtitle generator generates 
subtitle data for a plurality of languages in the form of a text, the subtitle data that 
should be reproduced together with an image based on the video data, and stores the 
subtitle data in the fixed-type storage; the buffer temporarily stores the data stored in 
the fixed-type storage; the data writer records the coded video data and subtitle data 

15 that are temporarily stored in the buffer, in an information storage medium; and the CPU 
controls the encoding process of the video data, the process for generating the subtitle 
data and a process for recording the coded video data, and the text data for subtitles in 
respective separate areas on an information storage medium. 

Preferably, the recording apparatus further comprises a font data generator 

20 which generates font data for converting text data for subtitles into graphic data, 
wherein the font data generator generates font data needed for converting the text data 
for subtitles into the form of graphic data and stores the font data in the fixed-type 
storage, the buffer temporarily stores the font data stored in the fixed-type storage, the 
data writer records the font data temporarily stored in the fixed-type storage on an 

25 information storage medium, and the CPU controls the process for generating the font 
data and process for recording the font data in a separate area on the information 
storage medium. 

Preferably, the CPU controls the text data for subtitles to be recorded in a 
separate space for each language if the text data is data of multiple languages. 



13 



By including test data, which is to be converted into graphic data and then output, 
and output synchronization information for synchronizing with reproduction of the video 
images, the subtitle generator generates the subtitle data. 

By including text data, which is to be converted into graphic data and then 
5 output, and output location information indicating a location where the graphic data will 
be output when the graphic data is overlapped with an image based on the video data, 
the subtitle generator generates the text data for subtitles. 

By including text data, which is to be converted into graphic data and then 
output, and information for expressing the output of the graphic data with a plurality of 
10 sizes when the graphic data is overlapped with an image based on the video data, the 
subtitle generator generates the subtitle data. 

Preferably, the coded video data is divided into recording units that should be 
reproduced continuously, and the subtitle generator generates the text data for subtitles 
such that the size of the entire subtitle data corresponding to the recording unit is 
15 limited. 

Preferably, the coded video data is divided into recording units that should be 
reproduced continuously, and after the text data for subtitles corresponding to the 
recording unit is divided into a plurality of language sets, the subtitle generator 
generates the text data for subtitles such that the size of all of the subtitle data forming 
20 the one language set is limited. 

It is preferable that the subtitle generator generates text data forming the text 
data for subtitles in Unicode for supporting multi-language character sets. 

The subtitle generator generates performs encoding by using UTF-8 by which 
one character is coded into a plurality of 8-bit units if the text data for subtitles is formed 
25 only with characters of ASCII, which is a basic English character set, or IS08859-1, 
which is a Latin-extended character set. 

The subtitle generator performs encoding by using UFT-16 by which one 
character is coded into a plurality of 16-bit units if the text data for subtitles includes a 
character having a code point value of a 2-byte size in Unicode. 



14 



Preferably, the data writer records data on an information storage medium which 
is a removable type easily placed on and removed from the recording apparatus. 

Preferably, the data writer records data on an optical disc on which data is 
recorded by using an optical apparatus. 
5 To achieve the above objectives, on an information storage medium according to 

the present invention, each subtitle data item is not coded together with AV data and 
within AV data, but is recorded in the form of separate text data in a separate recording 
space. In addition, on the information storage medium, separate font data for 
rendering subtitle data that is in the form of text data is recorded. Also, 
10 synchronization information for interlocking subtitle data with AV moving pictures for 
which decoding process is finished, and output information for screen output are 
recorded. The subtitle data corresponds to sub picture data in the conventional DVD. 

That is, on the information storage medium according to the present invention, 
the following elements are recorded: 
15 1) AV data (clip) into which video information is compression encoded; 

2) text data for multi-language subtitles; and 

3) font data for rendering text data. 

FIG. 9 is a block diagram of a reproducing apparatus according to a preferred 
embodiment of the present invention. 

20 Referring to FIG. 9, the reproducing apparatus comprises a reader block which 

reads AV data, text data for subtitles, and downloaded font data stored in an information 
storage medium, a decoder block for decoding AV data, a Tenderer block which renders 
text files, and a blender block which synthesizes moving pictures output from the 
decoder block with subtitle data output from the Tenderer block. 

25 In addition, the reproducing apparatus further comprises a buffer block, which 

buffers data between the reader block and the decoder and Tenderer blocks and stores 
determined font data, and may further comprise a storage block for residing font data 
that are stored in advance as defaults. 

In the present invention, rendering means all needed activities related to 
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converting subtitle text data into graphic data so as to be displayed on a display 
apparatus. That is, rendering includes producing graphic data to form a subtitle image 
by repeating the process for finding a font matching with the character code of each 
character in the text data in the downloaded font data read from the information storage 
5 medium or from the residing font data, and converting the font data into graphic data. 
Rendering also includes selecting or converting colors, selecting or converting the size 
of characters, and producing graphic data appropriate to writing in horizontal lines or 
vertical lines. In particular, when the font data being used is an outline font, font data 
defines the shape of each character as a curve formula. In this case, rendering also 
10 includes a rasterizing process for generating graphic data by processing the curve 
formula. 

FIG. 10 is a diagram of the data structure of text data stored in an information 
storage medium according to the present invention. 

Referring to FIG. 10, text data is recorded separately from AV streams. The text 

15 data includes synchronization information, display area information, and display style 
box information. The synchronization information is to be added to data to be output 
with subtitles in a rendering process and then will be used for synchronizing with video 
information which is decoded from AV stream data. The display area information 
designates a location on which rendered subtitle data are displayed on a screen. 

20 Display style box information contains information on the size of characters, writing of 
rendered subtitle data in horizontal lines or in vertical lines, and arrangement, colors, 
contrast, etc., in a display area. In addition, since text data for each of a plurality of 
languages should be written, the text data also contains information indicating a 
language. This multi-language data may be stored in separate spaces for respective 

25 languages, or may be stored in one space after being multiplexed in order of output time 
irrespective of languages. 

FIG. 11 is a preferred embodiment of text data for subtitles according to the 
present invention. 

Referring to FIG. 11, a markup language is used as text data for subtitles in the 
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present embodiment. Considering that the purpose of use is for subtitles, a minimal 
number of tags or elements in the markup language used for subtitles are used, and as 
described above, tags or attributes for synchronization and screen display may be 
included. Here, subtitle, head, meta, body, p elements are shown as examples. In 
5 the present embodiment, information is displayed with an attribute. Attributes used in 
the example are as follows. 

- start: A time at which subtitle data corresponding to moving pictures should be 
output when the start time of the moving pictures that should be reproduced together 
with the subtitle data is set to 0. A time at which subtitles are displayed is expressed in 

10 the form of time (HH): minute (MM): second (SS): frame (FF). The time can be 
expressed in units of 1/1000 second. Also, if video data is MPEG video, the time may 
have a presentation time stamp (PTS) value of video images on which the subtitle 
overlays and is displayed. Generally, the PTS value is a count value operating at 
27MHz or 90kHz. If the PTS value is used, the subtitle data can be accurately 

15 matched with video data and operated. 

- end: A time at which a displayed subtitle disappears and has the same type of 
attribute value as 'start'. 

- position: This indicates the coordinates of the top left-hand vertex in a video 
area in a display area in which subtitle data is to be displayed. 

20 - direction: This indicates the direction of subtitle data to be displayed. 

- size: This indicates the width or height of a display area in which subtitle data is 
to be displayed. If the attribute value of "direction" is "horizontal", a fixed width value of 
a subtitle data box is indicated, and if "vertical", a fixed height value of the subtitle data 
box is indicated. 

25 Among used elements, a subtitle element is used to indicate the root of text data, 

and a head element is used to include a meta element which deals with information 
needed by all of the text data, or a style element which is not shown in the example of 
FIG. 11. In the present embodiment, meta element is used to express the title of the 
corresponding text data and the language to be used. That is, when multiple 



languages are selected, by using meta information in the text data, a desired language 
text file can be conveniently selected. Also, languages can be distinguished by the 
names of text files, or by directory names, if a different directory for each language text 
file is prepared. 

5 Thus stored subtitle data is loaded into the buffer of the reproducing apparatus 

before video data is reproduced, and with the reproduction of video data, the subtitle 
data is converted into graphic data by the renderer block and made to overlap video 
images. Accordingly, the subtitle data is displayed in a predetermined display area at 
an exact time. As described above, for the text data, in addition to the subtitle 

10 character data, control information should also be written in a predetermined format or 
syntax. Accordingly, preferably, the renderer block has a parser function for verifying 
that a text file to be stored is written according to a predetermined syntax. Also, in 
order to synchronize the subtitle data with video images decoded by the decoder block 
by using the synchronization information included in the text file, preferably, there is a 

15 channel through which events for sending or determining information on the reproducing 
time and the reproducing state of the decoder block are exchanged with the decoder 
block. 

FIG. 12 is a diagram of the data structure of text data for a language other than 
the language of FIG. 1 1 . 

20 Referring to FIG. 12, when video data and text data are recorded in different 

areas as this, supporting multiple languages can be achieved by coding the video data 
separately from the subtitle data and then simply adding text data of respective different 
languages to the coded video data. Also, when subtitle data and font data that are not 
stored with video data on the information storage medium are downloaded through 

25 networks or loaded on the reproducing apparatus from an additional information storage 
medium, thus, subtitle data is easily used in other cases. 

When multiple languages are thus supported, a character code to be used for the 
text data should be determined. In the present invention, as a preferred embodiment, 
Unicode is used. Unicode is a character code made to express languages throughout 
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the world with more than 65,000 characters. According to the Unicode, each character 
is expressed by a code point in Unicode. Characters to express respective languages 
are sets of code points having regularly continuous values. The characters having a 
continuous space of code points are referred to as a code chart. Also, Unicode 
5 supports UTF-8, UTF-16, and UTF-32 as coding formats for actually storing or 
transmitting character data, that is, the code points. These formats are to express one 
character by using a plurality of data items with an 8-bit length, 16-bit length, and 32-bit 
length, respectively. 

An ASCII code for expressing English characters and an IS08859-1 code for 

10 expressing languages of European countries by expanding Latin have code point values 
from 0x00 to OxFF in Unicode. Japanese Hirakana characters have code point values 
from 0x3040 to 0x309F. The 11,172 characters for expressing modern Korean have 
code point values from OxACOO to 0XD7AF. Here, Ox indicates that the code point 
value is expressed by hexadecimal numbers. 

15 If subtitle data comprises only English characters, it is preferable that coding is 

performed by using UTF-8. For Korean or Japanese subtitle data, if UTF-8 is used, 
one character should be expressed using 3 bytes. If UTF-18 is used, it can be 
expressed in 2 bytes but each of the English characters included in the subtitle data at 
this time is also expressed in 2 bytes. 

20 Each country has its own character code different from Unicode. For example, 

in the Korean character code set, KSC5601, a Korean character has a 2-byte code 
point value and an English character has 1-byte code point value. If the subtitle data is 
generated by using not Unicode but each nation's character set, each reproducing 
apparatus should understand all of these character sets such that the load for 

25 implementation increases. 

Font data is needed in order to process subtitle data as text data. Also, in order 
to support multiple languages, the font data should support multiple languages. 
However, it is difficult to manufacture all reproducing apparatuses having these fonts 
that support multiple languages. Accordingly, in the present invention, font data only 
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for the characters used in an information storage medium are recorded in the 
information storage medium as subtitle data such that in a reproducing apparatus, such 
font data is loaded into a buffer before reproducing video data and then used. That is, 
the reproducing apparatus links each piece of subtitle text data with font data and then 
5 reproduces the data. Link information of subtitle text data and font data is recorded in 
the text data for subtitles or in a separate area. Considering a case where a user 
executes a language change during reproduction of data, the reproducing apparatus 
loads subtitle data and font data, which correspond to video data and should be 
continuously reproduced before reproduction, and then uses the data. Here, 

10 continuous reproduction means reproduction without cease in the video and audio 
outputs of the video data. Generally, a reproducing apparatus reproduces data by 
storing a predetermined amount of data in a video and audio buffer and if underflow in 
the buffer of the reproducing apparatus is prevented, continuous reproduction is 
possible. When subtitles or font data corresponding to video data should be read 

15 again through the reader block in order to change subtitles during reproduction, if 
underflow of the video and audio data does not occur during the time, loading in 
advance is not needed. 

FIG. 13 is an example of a text file used in the present invention. 

Referring to FIG. 13, in the present embodiment, a style element is used in a 

20 head element in order to use a CSS file format as an application of a style in a markup 
language for implementing a text file. By using CSS, subtitle data can use a variety of 
fonts with different sizes and colors. 

Meanwhile, some users may feel the subtitle styles that are set as defaults are 
not convenient. For example, a person with bad eyesight may feel inconvenience if the 

25 size of the font of the subtitle text is small. Accordingly, it is one of the many important 
issues on how to apply and display a style to satisfy ordinary users or persons with bad 
eyesight when applied to an identical text file. Therefore, by allowing users to 
determine the style, such as the size of a font, through a menu when reproducing an 
information storage medium in a first reproducing apparatus, a style sheet which is for 
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applying a style according to a user's settings and has a plurality of options that can be 
selected by the user can be used. 

In the present invention, an @user rule by which a subtitle style according to a 
user can be set will now be explained. User type is a set of CSS attributes. In the 
5 present embodiment, a detailed distinction of user types, that is, the degree of bad 
eyesight, is not relevant, and therefore, only two cases as follows will be explained. 

- small: a style for a user with normal eyesight 

- large: a style for a user with bad eyesight 

As shown in FIG. 14, subtitles which are preset by using an @user rule or to 
10 which different styles are applied for users with good eyesight or with bad eyesight can 
be displayed. 

Unlike this, it is also possible for a reproducing apparatus to output subtitles with 

applying a different position and size according to the user's preference without using 

the position and size determined by the subtitle data. 
15 FIG. 15 is an example in which the text data for subtitles implemented in FIG. 11 

is displayed on an actual screen. 

Referring to FIG. 15, since in the screen expressed by the second <p> element, 

the width value of the subtitle data display area is fixed to 520 by the "size" attribute, 

subtitle data that cannot be expressed within one line is displayed after changing a line. 
20 As another embodiment, it can be made that subtitle data can be output only in a 

display area and by using a line change element (br), line change can be selected 

forcibly. 

The third <p> element is an example in which by a "direction" attribute, the 
display of subtitle data is vertically performed. 
25 FIG. 16 is an example showing a case where a user executes a language 

change while subtitles in a predetermined language are being reproduced. 

Referring to FIG. 16, when this language change is needed, a reproducing 
apparatus changes subtitle text data being reproduced, links font data corresponding to 
text data, renders data of the changed language, and by doing so, outputs the subtitles. 
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If data for subtitles and font data for this are all loaded in the buffer, continuous 
reproduction of video data can be easily performed. If text data or font data desired to 
be changed is not loaded in the buffer, the data should be loaded into the buffer. At 
this time, cease can occur in reproduction of video data. 
5 For multi-language conversion without cease of video reproduction, the sizes of 

data for subtitles and font data should be limited to less than the sizes of the respective 
buffers. In this case, however, the number of supported languages is restricted. 
Accordingly, in the present invention, this problem is solved by creating a unit referred 
to as a language set. 

10 FIG. 17 is an example of a plurality of language sets of subtitle data and font data 

for multiple languages. 

Referring to FIG. 17, subtitle data and font data for a plurality of languages added 
to one video image are divided into a plurality of language sets. Subtitle data and font 
data that correspond to one language set are limited to a size that is less than the size 

15 of the buffer. After a language set containing subtitle data of a language selected by a 
user or selected as a default by the reproducing apparatus is loaded in the buffer before 
reproducing video data, reproducing video data begins. When the user executes a 
language change, the language change with the subtitle data included in this language 
set can be done without cease because the data is already loaded in the buffer. 

20 However, if a change to a language not included in this language set is made, the 
reproducing apparatus loads again the subtitle data and font data of the desired 
language set. In this case, data of the existing language set is all deleted. At this 
time, in reproducing video data, a cease may occur. Thereafter, if a language change 
is performed, a language change operation is performed again according to the relation 

25 between the language and the language set loaded in the buffer. Information on the 
language set may be recorded on an information storage medium or by considering the 
data stored in an information storage medium and the size of the buffer in the 
reproducing apparatus, and the reproducing apparatus may determine this arbitrarily 
when reproducing data. 



The relation between information needed in reproducing video data and the 
subtitle data will now be explained with a preferred embodiment. 

Clip will be used as a term indicating a recording unit of video data, and PlayList 
and Playltem will be used to indicate reproducing units. 
5 In an information storage medium according to the present invention, AV streams 

are separated and recorded in units of clips. Usually, a clip is recorded in a continuous 
space. In order to reduce the volume, AV streams are compressed and recorded. 
Accordingly, in order to reproduce the compressed AV streams, attribute information of 
the compressed video data should be informed. Therefore, Clip information is 

10 recorded in each clip. Clip information contains audio video attributes of the clip and 
an Entry Point Map in which information on the location of an Entry Point where random 
access is available in each predetermined interval is recorded. In an MPEG which is 
widely used as a video compression technology, the Entry Point is the location of I 
picture where an intra image is compressed, and the Entry Point Map is mainly used for 

15 a time search used to find a point in a predetermined time interval after the starting point 
of reproduction. 

PlayList is a basic unit of reproduction. In an information storage medium, a 
plurality of PlayLists is stored. One PlayList comprises a series of a plurality of 
Playltems. Playltem corresponds to a part of a clip, and more specifically, it is used in 
20 the form by which a reproduction start time and end time in the clip are determined. 
Accordingly, by using Clip information, the location of the part in an actual clip 
corresponding to the Playltem is identified. 

FIG. 18 is a diagram showing correlations of PlayList, Playltem, Clip information, 
and a clip. 

25 Referring to FIG. 18, in addition to PlayList, Playltem, Clip information, and a clip, 

in the present invention, a plurality of text data items for subtitles for each clip is 
recorded in a space separate from the clip. A plurality of data items for subtitles is 
linked to one clip and preferably, this link information is recorded in Clip information. 
To some clips, a plurality of data items for subtitles are linked, but for some clips, no 
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data items or only one data item for subtitles may be linked. When PlayList is 
reproduced, Playltems included in the PlayList are sequentially reproduced. As a 
result, any one of the clips linked to each Playltem and a plurality of subtitles linked to 
the clip are rendered and output. Since continuous reproduction between PlayLists is 
5 usually not guaranteed, it is preferable to load all linked text data for subtitles into a 
buffer before reproducing the PlayList. In FIG. 18, font data is not separately marked. 
Usually, it is preferable that font data is generated for each language. Accordingly, font 
data is recorded in a separate space. 

FIG. 19 is an example of a directory structure according to the present invention. 

10 Referring to FIG. 19, in a directory, clip, Clip information, PlayList, subtitle text 

data, and font data are stored in the form of files and stored in different directory spaces 
according to the respective types. As shown, it is preferable that text files for subtitles 
and font files are stored in directory spaces separate from video data. 

Preferably, an information storage medium according to the present invention is 

15 a removable information storage medium which is not fixed to a reproducing apparatus 
and, only when data is reproduced, can be placed and used. Unlike a fixed information 
storage medium with a high capacity such as a hard disc, the removable information 
storage medium has a limited capacity. Also, reproducing apparatuses for reproducing 
this medium often have a buffer with a limited size and low level function blocks with 

20 limited performance. Accordingly, together with video data recorded on a removable 
information storage medium, only subtitle data and font data used for the subtitle data 
are recorded on the information storage medium and by using the data when video data 
is reproduced from the information storage medium, the amount of data that should be 
prepared in advance can be minimized. A representative example of this removable 

25 recording medium is an optical disc. 

On an information storage medium according to the present invention, video data 
is stored in a space separate from subtitle text data. If this subtitle text data is for 
multiple languages and has font data for outputting the subtitle data, a reproducing 
apparatus loads only the subtitle data and font data in the buffer and then, while 
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reproducing video data, overlaps the subtitle data with a video image and outputs the 
subtitle data. 

FIG. 20 is an example showing a case where a reproducing apparatus outputs 
only subtitle data. 

5 Referring to FIG. 20, a reproducing apparatus may have a function outputting 

only subtitle data. That is, according to one of the many special reproduction functions, 
video data is not reproduced, and only subtitle data that is to be output overlapping the 
video data is converted into graphic and then output. In this case, subtitle data is used 
for learning a foreign language. Here, video data is not overlapped and only subtitle 

10 data is output. Also, both the synchronization information and location information are 
neglected, and the reproducing apparatus outputs a plurality of line data items 
comprising subtitle data on the entire screen, and waits for a user input. After watching 
all of the output subtitle data, the user sends a signal for displaying subtitle data for the 
next line to the reproducing apparatus so as to control the output time of the subtitle 

15 data. 

FIG. 21 is an example showing a case where a reproducing apparatus outputs 
subtitle data for more than one language at the same time. 

Referring to FIG. 21, as a preferred embodiment, a reproducing apparatus may 
have a function for outputting subtitle data for two or more languages at the same time 
20 when subtitle data comprises a plurality of languages. At this time, by using 
synchronization information of subtitle data for each language, subtitle data to be 
displayed on the screen is selected. That is, subtitle data is output in order of output 
start time, and when the output start times are the same, the subtitle data is output 
according to language. 

25 A function, by which while only subtitle data are reproduced, normal reproduction 

of video data can be started from the video data corresponding to a subtitle line data 
item, can also be implemented. 

FIG. 22 is an example showing a case where during reproduction of only subtitle 
data, normal reproduction of video data begins from video data corresponding to subtitle 



line data. 

As shown in FIG. 22, when the user selects one subtitle line data item, a 
reproducing time corresponding to the line data item is selected again, and video data 
corresponding to the time is normally reproduced. 
5 A recording apparatus in the present invention is an apparatus which records 

video data and subtitle data according to the present invention on an information 
storage medium. 

FIG. 23 is a block diagram of a recording apparatus according to the present 
invention. 

10 Referring to FIG. 23, the recording apparatus comprises a central processing unit 

(CPU), a fixed high-capacity storage, an encoder, a subtitle generator, a font generator, 
a writer, and a buffer. 

The encoder, subtitle generator, and font generator may be implemented by 
software on the CPU. 

15 In addition, a video input unit for receiving video data in real time may also be 

included. 

The storage stores a video image that is the object of encoding, or video data 
that is coded by the encoder. In addition, the storage stores a dialogue attached to the 
video data and large volume font data. The subtitle generator receives information on 

20 the output time of a subtitle line data item from the encoder, receives subtitle line data 
from the dialogue data, makes subtitle data for the subtitles, and stores the subtitle data 
in a fixed-type storage apparatus. The font generator generates font data containing 
characters used in the subtitle data for subtitles from the large volume font data and 
stores the font data in the fixed-type storage apparatus. That is, the font data stored in 

25 the information storage medium is part of the large volume font data stored in the 
fixed-type storage apparatus. This process for generating data in the form to be stored 
in an information storage medium is referred to as authoring. 

If the authoring process is finished, coded video data stored in the fixed-type 
storage apparatus are divided into clips, which are the recording units, and recorded on 



an information storage medium. Also, subtitle data for subtitles added to video data 
contained in the clip are recorded in a separate area. Further, font data needed to 
convert the subtitle data into graphic data is recorded in a separate area. 

The video data is divided into reproducing units that should be continuously 
5 reproduced, and usually, this reproducing unit comprises a plurality of clips. As a 
preferred embodiment, the size of subtitle data, which should be overlapped with a 
video image included in one reproducing unit and is output, is limited to be less than a 
predetermined size when the data for a plurality of languages is all added to the subtitle 
data. As another preferred embodiment, subtitle data, which should be overlapped 

10 with a video image included in one reproducing unit, is divided into language sets with 
which a language change should be performed continuously when video data is 
reproduced. Subtitle data included in one reproducing unit comprises a plurality of 
language sets and the size of subtitle data included in one language set, plus data for a 
plurality of languages, is limited to be less than a predetermined size. 

15 The subtitle data comprises character codes using Unicode and, preferably, the 

data form actually recorded is coded by UTF-8 or UTF-16. 

Video data, subtitle data for subtitles, and font data recorded in the fixed-type 
storage apparatus are temporarily stored in the buffer and are recorded on an 
information storage medium by the writer. The CPU executes a software program 

20 controlling each device so that these functions are performed in order. 

[Effect of the Invention] 

As described above, according to the present invention, text data for 
multi-language subtitles are made to be a text file and then recorded in a space 
25 separate from AV streams such that more diverse subtitle can be provided to users and 
recording space arrangement can be conveniently performed. 

Meanwhile, font data for this are made to have a minimum size by collecting 
characters needed for the subtitle text and are stored separately in an information 
storage medium and used. 



27 



What is claimed is: 

1. An information storage medium on which video data is recorded, 
comprising: 

a plurality of clips that are recording units in which the video data are stored; and 
text data for subtitles which are recorded separately from the clips and are to be 
overlapped with an image based on the video data and then output, wherein the text 
data for subtitles includes data for providing subtitles in at least one language. 

2. The information storage medium of claim 1, further comprising: 

font data of characters, which is recorded separately from the clips and for 
graphic expression of the text data and are used in the text data for subtitles. 

3. The information storage medium of any one of claims 1 and 2, wherein the 
text data for subtitles is recorded in a separate space for each language if the text data 
is of multiple languages. 

4. The information storage medium of any one of claims 1 and 2, wherein the 
text data for subtitles includes character data to be converted into graphic data and then 
output and output synchronization information for synchronizing with the video data. 

5. The information storage medium of any one of claims 1 and 2, wherein the 
text data for subtitles includes character data to be converted into graphic data and then 
output and output location information indicating a location in which the graphic data is 
to be displayed when the graphic data is overlapped with an image based on the video 
data. 

6. The information storage medium of any one of claims 1 and 2, wherein the 
text data for subtitles includes character data to be converted into graphic data and then 
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output and information for expressing the output of the graphic data in a plurality of 
sizes when this graphic data is overlaid with a video image. 

7. The information storage medium of any one of claims 1 and 2, wherein the 
5 video data is divided into units that should be reproduced continuously, and the size of 

all of the text data for subtitles corresponding to one unit is limited. 

8. The information storage medium of any one of claims 1 and 3, wherein the 
video data is divided into units that should be reproduced continuously and comprises a 

10 plurality of reproducing units, the text data for subtitles corresponding to the one 
reproducing unit is divided into a plurality of language sets, and the size of all of the text 
data for subtitles forming one language set is limited. 

9. The information storage medium of any one of claims 1 and 3, wherein the 
15 text data forming the text data for subtitles are expressed and recorded in Unicode for 

supporting multi-language character sets. 

10. The information storage medium of claim 9, wherein if the text data for 
subtitles is formed only with characters of ASCII, which is a basic English character set, 

20 or IS08859-1, which is a Latin-extended character set, the text data is coded and 
recorded by using UTF-8 by which one character is coded into a plurality of 8-bit units. 

11. The information storage medium of claim 9, wherein if the text data for 
subtitles includes a character having a code point value of a 2-byte size in Unicode, the 

25 text data is coded and recorded by using UFT-16 by which one character is coded into a 
plurality of 16-bit units. 



12. The information storage medium of claim 1, wherein the information 
' storage medium is a removable type that is easily placed on and removed from a 
reproducing apparatus. 

5 13. The information storage medium of claim 12, wherein the information 

storage medium is an optical disc from which the reproducing apparatus is capable of 
reading data by using an optical apparatus of the reproducing apparatus. 

14. A reproducing apparatus which reproduces data from an information 
10 storage medium on which video data is recoded, wherein on the information storage 

medium, video data is coded and divided into clips that are recording units and recorded 
in a plurality of clips and text data for subtitles that are formed with data of a plurality of 
languages and are overlapped as graphic data with an image based on the video data 
and then output are recorded separately from the clips, the reproducing apparatus 
15 comprising: 

a data reproducing unit which reads data from the information storage medium; 

a decoder which decodes the coded video data; 

a renderer which converts the text data for subtitles into graphic data; 

a blender which overlays the video data with the graphic data to generate an 

20 image; 

a buffer which temporarily stores the video data; and 
a buffer which stores the text data for subtitles. 

15. The reproducing apparatus of claim 14, wherein on the information 
25 storage medium, font data of characters that is used in the text data for subtitles for 

graphic expression of the text data are recorded in a separate space, the reproducing 
apparatus has a buffer for temporarily storing the font data, and the renderer has a 
function for converting the text data for subtitles into graphic data by using the font data. 



16. The reproducing apparatus of any one of claims 14 and 15, wherein if the 
text data for subtitles is data of multiple languages, the text data is' recorded in a 
separate space for each language, and the reproducing apparatus has a function by 
which the text data for subtitles for a language that is selected by a user or is set as an 

5 initial reproducing language in the reproducing apparatus are temporarily stored in the 
buffer; font data for converting the text data for subtitles into graphic data are 
temporarily stored in the buffer for fonts; and, simultaneously, while reproducing video 
data, the text data for subtitles is converted into graphic data and the graphic data is 
output. 

10 

17. The reproducing apparatus of any one of claims 14 and 15, wherein on 
the information storage medium, the text data for subtitles including synchronization 
information, by which the text data for subtitles is converted into graphic data and the 
graphic data is overlapped with an image based on the video data and output, are 

15 recorded, and the reproducing apparatus has a function for controlling an output start 
time and end time of the text data for subtitles by synchronization information. 

18. The reproducing apparatus of any one of claims 14 and 15, wherein on 
the information storage medium, the text data for subtitles, including character data 

20 which will be converted into graphic data and be output, and output location information 
indicating a location where the graphic data is to be output when the graphic data is 
overlapped with an image based on the video and output, are recorded, and the 
reproducing apparatus has a function for controlling a location where the text data for 
subtitles is overlapped with an image based on the video data, by the output location 

25 information. 

19. The reproducing apparatus of claim 16, wherein the video data recorded 
on the information storage medium is divided into units that should be reproduced 
continuously, and within a limited size of all of the text data for subtitles corresponding 
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to the recording unit, the text data is recorded, the reproducing apparatus has a function 
by which all of the text data for subtitles whose size is limited is stored in the buffer 
before reproducing the units that should be reproduced continuously, and if language 
change occurs during reproduction, subtitle data corresponding to the language stored 
5 in the buffer is output. 

20. The reproducing apparatus of claim 16, wherein the video data recorded 
on the information storage medium is divided into units that should be reproduced 
continuously; the text data for subtitles corresponding to one unit is divided into a 

10 plurality of language sets; the text data for subtitles forming the one language set is 
recorded so that all of the text data is limited; and the reproducing apparatus has a 
function, by which the text data for subtitles corresponding to a language set containing 
the subtitle data which should be output simultaneously with video data, is all stored in 
the buffer before reproducing the unit that should be reproduced continuously, and 

15 when a language change occurs during reproduction, if the text data for subtitles for the 
language are in the buffer, the text data for subtitles for the language are output, and if 
the text data for subtitles for the language are not in the buffer, the text data for subtitles 
corresponding to the language set containing the text data for subtitles for the language 
is all stored in the buffer and the text data for subtitles for the language are output. 

20 

21. The reproducing apparatus of any one of claims 14 and 15, wherein on 
the information storage medium, the text data for subtitles containing character data, 
which will be converted into graphic data and be output, and information for indicating 
the output of a plurality of graphic data items when the graphic data is overlapped with 

25 an image based on the video data are recorded, and the reproducing apparatus has a 
function by which the size of the subtitle data is selected among the plurality of sizes in 
the size information by a user and is output. 
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22. The reproducing apparatus of any one of claims 14 and 15, wherein on 
the information storage medium, text data forming the text data for subtitles is 
expressed and recorded in Unicode for supporting multi-language sets, and the 
renderer converts the characters expressed in Unicode into graphic data. 

5 

23. The reproducing apparatus of claim 22, wherein on the information 
storage medium, if the text data for subtitles is formed only with characters of ASCII, 
which is a basic English character set, or IS08859-1, which is a Latin-extended 
character set, the text data is coded and recorded using UTF-8 by which one character 

10 is coded into a plurality of 8-bit units, and the renderer converts the characters 
expressed by UFT-8 into graphic data. 

24. The reproducing apparatus of claim 22, wherein on the information 
storage medium, if the text data for subtitles include a character having a code point 

15 value of a 2-byte size in Unicode, the text data is coded and recorded by using UFT-16 
by which one character is coded into a plurality of 16-bit units, and the renderer 
converts the characters expressed by UTF-16 into graphic data. 

25. The reproducing apparatus of any one of claims 14 and 15, wherein the 
20 information storage medium is a removable type that is easily placed on and removed 

from a reproducing apparatus, and the reproducing apparatus reads data from and 
reproduces data on the removable information storage medium. 

26. The reproducing apparatus of claim 25, wherein the information storage 
25 medium is an optical disc from which the reproducing apparatus is capable of reading 

data by using an optical apparatus of the reproducing apparatus, and the reproducing 
apparatus reads and reproduces data recorded on the optical disc. 



27. The reproducing apparatus of any one of claims 14 and 15, wherein the 
reproducing apparatus has a function for converting only the subtitle data into graphic 
data and outputting the graphic data without reproducing video data recorded on the 
information storage medium. 

5 

28. The reproducing apparatus of claim 27, wherein the reproducing 
apparatus has a function for reading and outputting subtitle data for more than one 
language if on an information storage medium, text data for subtitles of a plurality of 
languages is recorded. 

10 

29. The reproducing apparatus of any one of claims 27 and 28, having a 
function by which during output of the text data for subtitles, reproduction of video data 
begins from video data synchronized with corresponding subtitle data and subtitle data 
are overlapped with a video image and then output. 

15 

30. A recording apparatus which records video data on an information storage 
medium, comprising: 

a data writer which writes data on the information storage medium; 
an encoder which codes the video data output from the data writer; 
20 a subtitle generator which generates subtitle data that will be added to the video 

data and output; 

a central processing unit (CPU); 
a fixed-type storage; and 

a buffer, wherein the encoder stores video data in the fixed-type storage after 
25 dividing video images into clips that are recording units and compression encoding the 
clips; the subtitle generator generates subtitle data for a plurality of languages in the 
form of a text, the subtitle data that should be reproduced together with an image based 
on the video data, and stores the subtitle data in the fixed-type storage; the buffer 
temporarily stores the data stored in the fixed-type storage; the data writer records the 
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coded video data and subtitle data that are temporarily stored in the buffer in an 
information storage medium; and the CPU controls the encoding process of the video 
data, the process for generating the subtitle data, and a process for recording the coded 
video data and the text data for subtitles in respective separate areas on an information 
5 storage medium. 

31 . The recording apparatus of claim 30, further comprising: 

a font data generator which generates font data for converting text data for 
subtitles into graphic data, wherein the font data generator generates font data needed 

10 for converting the text data for subtitles into the form of graphic data, and stores the font 
data in the fixed-type storage, the buffer temporarily stores the font data stored in the 
fixed-type storage, the data writer records the font data temporarily stored in the 
fixed-type storage on information storage medium, and the CPU controls the process for 
generating the font data and a process for recording the font data in a separate area of 

15 the information storage medium. 

32. The recording apparatus of any one of claims 30 and 31 , wherein the CPU 
controls the text data for subtitles to be recorded in a separate space for each language 
if the text data is data of multiple languages. 

20 

33. The recording apparatus of any one of claims 30 and 31, wherein by 
including character data to be converted into graphic data and then be output and 
output synchronization information for synchronizing with reproduction of the video 
images, the subtitle generator generates the subtitle data. 

25 

34. The recording apparatus of any one of claims 30 and 31, wherein by 
including character data to be converted into graphic data and then be output and 
output location information indicating a location where the graphic data will be output 
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when the graphic data is overlapped with an image based on the video data, the subtitle 
generator generates the text data for subtitles. 

35. The recording apparatus of any one of claims 30 and 31, wherein by 
5 including character data to be converted into graphic data and then be output and 
information for expressing the output of the graphic data with a plurality of sizes when 
the graphic data is overlapped with an image based on the video data, the subtitle 
generator generates the subtitle data. 

10 36. The recording apparatus of any one of claims 30 and 31, wherein the 

coded video data is divided into recording units that should be reproduced continuously, 
and the subtitle generator generates the text data for subtitles such that the size of all of 
the subtitle data corresponding to the recording unit is limited. 

15 37. The recording apparatus of any one of claims 30 and 31, wherein the 

coded video data is divided into recording units that should be reproduced continuously, 
and after the text data for subtitles corresponding to the recording unit are divided into a 
plurality of language sets, the subtitle generator generates the text data for subtitles 
such that the size of the entire subtitle data forming the one language set is limited. 

20 

38. The recording apparatus of any one of claims 30 and 31, wherein the 
subtitle generator generates text data forming the text data for subtitles in Unicode for 
supporting multi-language character sets. 

25 39. The recording apparatus of claim 37, wherein the subtitle generator 

generates performs encoding by using UTF-8 by which one character is coded into a 
plurality of 8-bit units if the text data for subtitles are formed only with characters of 
ASCII, which is a basic English character set, or IS08859-1, which is a Latin-extended 
character set. 
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40. The recording apparatus of claim 37, wherein the subtitle generator 
performs encoding by using UFT-16 by which one character is coded into a plurality of 
16-bit units if the text data for subtitles includes a character having a code point value of 
a 2-byte size in Unicode. 

41 . The recording apparatus of any one of claims 30 and 31 , wherein the data 
writer records data on an information storage medium which is a removable type that is 
easily placed on and removed from the recording apparatus. 

42. The recording apparatus of claim 41, wherein the data writer records data 
on an optical disc on which data is recorded by using an optical apparatus. 
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